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Amendments to the Claims: 



1. (Currently Amended). A method for calculating the similarity of at least one chemical 
compound to at least one chemical probe, the at l e ast one chemical probe including at least 
another chemical compound, th e method comprising the steps of: 

(a) creating utilizing at least one chemical descriptor for each compound in of a coll e ction 
plurality of compounds , each descriptor comprising a row of a molecule-descriptor matrix X \ 

(b) representing at least one ch e mical descriptor for each compound as at least one a 
column v e ctor comprising of the at least one molecule- descriptor fr e quencies matrix, the 
entries of the molecule-descriptor matrix comprising a mathematical function of the weighting 
of each descriptor for each compound ; 

(c) representing th e collection of compound the at l e ast one vector as a first vector of a 
molecule descriptor matrix; 

(d) performing a singular value decomposition (SVD) of the molecule-descriptor matrix to 
produce at l e ast one singular matrix resultant matrices ; 

(e) £d] generating at least one creating a chemical probe descriptor vector for the at least 
one chemical probe , the entries of the chemical probe descriptor vector comprising a 
mathematical function of the weighting of each descriptor for each chemical probe ; 

(f) (e) using the at least one singular matrix of the resultant matrices to transform calculate 
the at least one chemical probe descriptor of the at least on e chemical probe into a first 
coordinate system at least substantially similar to a second coordinate system of the at least one 
compound; similarity between the at least one chemical probe and at least one compound of 
the molecule descriptor matrix; and 

(g) calculating th e similarity of transform e d probe s to the compounds in the collection, and 
fe> (f) outputting a list of at least a subset of compounds in the coll e ction rank e d in order of 

providing an output indicating the similarity te between the at least one chemical probe and the 
at least one probe compound . 
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2. (Currently Amended). The method as recited in claim 1, wherein said step of cr e ating each 
of the at least one descriptor includ e s generating chemical descriptors comprise at least one of an 
atom pair descriptor and a topological torsion descriptors from chemical conn e ction tabl e s of the 
coll e ction of compounds descriptor . 

3. (Canceled). 



4. (Currently Amended). The method as recited in claim 1, wherein said molecule descriptor 
matrix is denoted as X, wh e rein said step of performing singular valu e d e composition includes 
step comprises the steps of: 

generating singular resultant matrices as [[X = PUQ T ]] of rank r P, Z and O 1 \ such that 

molecule-descriptor matrix X = PSO T . wherein 

P is a mxr matrix, called the left singular matrix (r is the rank of X), and its 
columns are the eigenvectors of XX T corresponding to nonzero eigenvalues; 

0 is a nxr matrix, called the right singular matrix, whose columns are the 
eigenvectors of X X corresponding to nonzero eigenvalues; and 

Lis a rxr diagonal matrix whose nonzero elements, a ^ <T 7 tr r called singular 

values, are the square roots of the eigenvalues and have the property that a , > a ? > . . . > o- r ^-aftd: 

a r e duced dimension approximation of X x defin e d as X tr^J^A^ fc k « r > where., P and Q ar e 
the left and right singular matrices representing correlation s among descriptors and compounds 
resp e ctively, and ^repre s ents the singular valu e s, 

wherein the at least one produced singular matrix includes a pseudo object denoted as Op -and 
is calculat e d from a probe F by Op = F^ ^^fe-raftd 
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wherein said step of calculating the similarity betw e en the pseudo obj e ct Op and the 
compounds in coll e ction is computed by taking a dot product of a normaliz e d v e ctor of Op, with 
e ach normaliz e d row of [[/VI]. 

5. (Currently Amended). The method as recited to claim 4, wherein said similarity 
calculating st e p includes calculating cosine betw e en each pair of vectors between the at least one 
chemical probe and at least one compound of the molecule descriptor matrix is computed in 
accordance with the following steps: 

generating a reduced dimension approximation of X defined as Xy = PiAQV k < r; 

generating a pseudo-object, denoted as Of, where Of = F T P*£ V, and where F is a molecule- 
descriptor vector for the at least one chemical probe; and 

taking a dot product of Of with one or more columns of 0 T w respectively corresponding to 
the at least one compound . 

6. (Currently Amended). The method as recited in claim [4] 5, wherein said step of 
performing singular value decomposition includ e s deriving the reduced dimensional 
approximation of X by setting the k + 1 through r singular values of [[2]] oi + i to o yare equal to 
zero for 2* . 

7. (Currently Amended). The method as recited in claim [4] 5, wherein similarities of the 
pseudo object to compounds in the collection is calculated by setting th e first k singular values 
®f ex; to are equal to one for 2* . 

8. (Currently Amended). The method as recited in claim 7, wherein 2* said setting step 
includes using comprises an identity matrix /. 



9.-21. (Canceled). 
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22. (New). The method according to claim 1, wherein in said representing step the weighting 
comprises a frequency. 

23. (New). The method according to claim 22, wherein in said creating step the weighting 
comprises a frequency. 

24. (New). The method according to claim 5, wherein X k is the least squared closest to X. 

25. (New). A method for calculating the similarity between two chemical descriptors 
represented in a molecule-descriptor matrix X, comprising the steps of: 

(a) utilizing at least one chemical descriptor for each of a plurality of compounds, each 
descriptor comprising a row of the molecule-descriptor matrix X\ 

(b) representing each compound as a column of the molecule-descriptor matrix, the entries 
of the molecule-descriptor matrix comprising a mathematical function of the weighting of each 
descriptor for each compound; 

(c) performing a partial singular value decomposition (SVD) of the molecule-descriptor 
matrix to produce resultant matrices 2k, and Q k , such that partial matrix Xk = PhZtCfk , is the 
least squares closest approximation of X for any given k, k < r, where r is the rank of the matrix 
X\ 

(d) calculating the similarity between two chemical descriptors i and j by computing the dot 
product between row i and row j of the matrix P*; and 

(e) providing an output indicating the similarity between the two descriptors. 
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26. (New). A method for calculating the similarity of at least one chemical descriptor and at 
least one chemical probe, comprising the steps of: 

(a) utilizing at least one chemical descriptor for each of a plurality of compounds, each 
descriptor comprising a row of a molecule-descriptor matrix X\ 

(b) representing each compound as a column of the molecule-descriptor matrix, the entries 
of the molecule-descriptor matrix comprising a mathematical function of the weighting of each 
descriptor for each compound; 

(c) performing a partial singular value decomposition (SVD) of the molecule-descriptor 
matrix to produce resultant matrices and such that partial matrix X* = Pk^&k > is the 
least squares closest approximation of X for any given k, k < r, where r is the rank of the matrix 
X; 

(d) creating a chemical probe descriptor vector, Fi, for the at least one chemical probe, the 
entries of the chemical probe descriptor vector comprising a mathematical function of the 
weighting of each descriptor for each chemical probe; 

(e) creating a pseudo-object 0 F i, where Ofi = F\ T P k £~ l k ; 

(f) calculating the similarity between Ofi and at least one chemical descriptor / by 
computing the dot product between row i of matrix P k and Ofi; and 

(g) providing an output indicating the similarity between the chemical probe and the at 
least one chemical descriptor. 



Page 7 

Serial No. 09/546,399 

Attorney Docket No.: 105456-101US1 

27. (New). A method for calculating the similarity of at least one chemical probe, Fj, and at 
least one other chemical probe, F2, comprising the steps of: 

(a) utilizing at least one chemical descriptor for each of a plurality of compounds, each 
descriptor comprising a row of a molecule-descriptor matrix X\ 

(b) representing each compound as a column of the molecule-descriptor matrix, the entries 
of the molecule-descriptor matrix comprising a mathematical function of the weighting of each 
descriptor for each compound; 

(c) performing a partial singular value decomposition (S VD) of the molecule-descriptor 
matrix to produce resultant matrices Pk, £k, and <2k, such that partial matrix X* = Pk^^k , is the 
least squares closest approximation of X for any given k, k < r, where r is the rank of the matrix 
X; 

(d) creating a chemical probe descriptor vector for at least one chemical probe, Fi, the 
entries of the chemical probe descriptor vector comprising a mathematical function of the 
weighting of each descriptor for each chemical probe; 

(e) creating a pseudo-object Ofi, where Ofi = F\ T P^£' 1 ^ 

(f) creating a pseudo-object Or, where Of2 = F 2 r Pk£ ~\\ 

(g) calculating the similarity between the chemical probe descriptor vector Fi and the other 
chemical descriptor probe F 2 by computing the dot product between Ofi and Of2; and 

(h) providing an output indicating the similarity between the chemical probe, Fi, and 
chemical probe, F2. 
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28. (New). A method for calculating the similarity of at least one chemical compound to at 
least one chemical probe, comprising the steps of: 

(a) creating at least one descriptor for each compound in a text source; 

(b) representing each compound in the text source as at least one mathematical function of 
the weighting of each descriptor for each compound; 

(c) creating a chemical probe descriptor for the at least one chemical probe, the entries of 
the chemical probe descriptor comprising a mathematical function of the weighting of each 
descriptor for each chemical probe; 

(d) using the text source and the chemical probe descriptor for the at least one chemical 
probe to calculate a mathematical similarity between the text source and the chemical 
descriptor for the at least one chemical probe; and 

(e) providing an output indicating the similarity between the at least one chemical probe 
and the at least one chemical compound. 

29. The method as recited in claim 2, wherein each of the at least one chemical descriptors 
further comprise at least one of a charge pair descriptor, a hydrophobic pair descriptor, an 
inherent atom property descriptor, and a geometry descriptor. 



